SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Tsien et al., Roger Y. et al. 



(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(3) STREET: 4225 Executive Square, Suite 1400 

(C) CITY.- La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92037 

(V) COMPUTER READABLE FORM: 
H (A) MEDIUM TYPE: Floppy disk 

™ (B) COMPUTER: IBM PC compatible 

W (C) OPERATING SYSTEM: PC - DOS /MS -DOS 

Qj (D) SOFTWARE: Patentln Release #1.0, version #1.30 

"'^ (vi) CURRENT APPLICATION DATA: 

U, (A) APPLICATION NUMBER: UNKNOWN 

(B) FILING DATE: 1S-AUG-1997 
%S (C) CLASSIFICATION: 

s«(viii) ATTORNEY/AGENT INFORMATION : 
Us (A) NAME: Haile, Lisa A. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 072S7/05S001 

?K (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-S070 
P| (B) TELEFAX: 619/678-5099 

m " 

HI INFORMATION FOR SEQ ID NO:l: 

ill (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 716 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..714 



(xi> SEQU12NCE DESCRIPTION: SEQ ID NO:l: 

ATG ACT AAA GGA GAA GAA CTT TTC ACT GCA GTT GTC CCA ATT CTT GTT 48 
Met Ser Lys Gl.y Glu Glu Leu Phe Thr Ala Val Val Pro lie Leu Val 
15 10 IS 

GAA TTA GAT GGT GAT GTT AAT GGG CAC AAA TTT TCT GTC AGT GGA GAG 96 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
3!0 * 25 30 

GGT GAA GOT GAT GTA ACA TAC GGA AAA CTT ACC CTT AAA TTT ATT TGC 144 
Gly Glu Gly As\p Val Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
V 35 40 45 

ACT ACT GGA A7A CTA CCT GTT CCA TGG CCA ACA CTT GTC ACT ACT TTC 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

TCT TAT GGT GIT CAA TGC TTT TCA AGA TAC CCA GAT CAT ATG AAA CGG 240 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
65 70 75 80 



CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT GTA CAG CAA AGA 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Gin Arg 
85 90 95 

ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA CGT GCT GAA GTC 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
lis 120 125 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAT AAA TTG GAA TAC AAC 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA CAA AAG AAT GGA 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
145 ISO 155 ISO 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT 
.lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser Val 

!"*■ 165 170 175 

gAA CTA GCA GAC TAT TAT CAA CAA AAT ACT CCA ATT CTC GAT GGC CCT 
p$n Leu Ala Asp Tyr Tyr Gin Gin Asn Thr Pro He Leu Asp Gly Pro 

%| :L80 185 190 

fatC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA TCT GCC CTT TCG 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

J* 195 200 205 

AjffA GAT CCC J LAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TTT GTA 
Lys Asp Pro Asn Glu Ly3 Arg A3p His Met Val Leu Leu Glu Phe Val 
a 210 215 220 

AO GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA 
g|r Ala Ala Gly lie Thr His Gly Met Aso Glu Leu Tyr Lys 
225 230 235 



S> INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 
(3) TYPE: amino acid 
CO) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Lys c:iy Glu Glu Leu Phe Thr Ala Val Val Pro He Leu Val 
is 10 IS 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 2S 30 

Gly Glu Gly Asp Val Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cy3 
35 40 45 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
SO 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp Hi3 Met Lys Arg 
65 70 7S 80 

His Asp; Phe Ihe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Gin Arg 
85 90 95 

Thr He Phe Ihe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Glv He 
115 120 125 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 
? 5i! (A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 
!jf (C) STRANDEDNESS : single 

CJ (D) TOPOLOGY: linear 



>H (ix) FEATURE: 

(A) NAME/KEY: CDS 
%| (B) LOCATION: 1..720 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Qs GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG 
Vim Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

:.; :> 240 245 250 

GggC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG TTC AGC GTG TCC GGC 

Wl Glu Leu Aso Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
23S ' 260 265 270 

GAG GGC GAG CIGC GAT GCC ACC TAC GGC AAG CTG ACC CTG AAG TTC ATC 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Ly3 Phe He 
275 2B0 285 

TGC ACC ACC GGC AAG CTG CCC GTG CCC TGG CCC ACC CTC GTG ACC ACC 
Cys Thr Thr Cily Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

::30 295 300 

TTC GGC TAC CiGC GTG CAG TGC TTC GCC CGC TAC CCC GAC CAC ATG AAG 
Phe Gly Tyr (fly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
305 310 315 

CAG CAG GAC 'TTC TTC AAG TCC GCC ATG CCC GAA GGC TAC GTC CAG GAG 
Gin Gin Asp ]>he Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
320 325 330 

CGC ACC ATC irTC TTC AAG GAC GAC GGC AAC TAC AAG ACC CGC GCC GAG 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

335 340 345 3S0 

GTG AAG TTC GAG GGC GAC ACC CTG GTG AAC CGC ATC GAG CTG AAG GGC 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

355 * 360 365 

ATC GAC TTC <VAG GAC GAC GGC AAC ATC CTG GGG CAC AAG CTG GAG TAC 
He Asp Phe Lys Asp Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
*s« 370 375 380 

AAC TAC AAC .\GC CAC AAC GTC TAT ATC ATG GCC GAC AAG CAG AAG AAC 
Asn Tyr Asn iSer His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
385 390 395 

GGC ATC AAG (3TG AAC TTC AAG ATC CGC CAC AAC ATC GAG GAC GGC AGC 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
400 405 410 



Val 
415 


CAG 
Gin 


CCC 
Pro 


GCC 
Ala 


GAC 
Asp 


CAC 

Hia 
420 


TAC 

Tyr 


CAG 
Gin 


CAG 
Gin 


AAC 
Asn 


ACC 
Thr 
425 


CCC ATC GGC GAC GGC 
Pro lie Gly Asp Gly 
430 


576 


CCC 
Pro 


GTG 
Val 


CTG 


CTG 


CCC 
435 


GAC 


AAC 


CAC 
His 


TAC 


CTG 
440 


AGC 
Ser 


TAC CAG TCC GCC CTG 
Tyr Gin Ser Ala Leu 
445 


624 


AGC AAA 
Ser Lys 


GAC 
Asp 


CCC 
Pro 
4S0 


AAC 
Asn 


GAG 
Glu 


AAG 
Lys 


CGC 
Arg 


GAT 
Asp 
455 


CAC 
His 


ATG 
Met 


GTC CTG CTG GAG TTC 
Val Leu Leu Glu Phe 
460 


672 


GTG 

Val 


ACC 
Thr 


GCC 

Ala 

465 


GCC 
Ala 


GGG 

Gly 


ATC 

lie 


ACT 

Thr 


CAC 
Hi3 
470 


GGC 
Gly 


ATG 

Met 


GAC 
Asp 


GAG CTG TAC AAG TAA 

Glu Leu Tyr Lys * 

475 


720 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 
jhfc (B) TYPE: amino acid 

PI (D) TOPOLOGY: linear 

C9 (ii) MOLECULE TYPE: protein 

^1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
Ujl S 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

|Jf 20 25 30 

~Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
3 3S 40 45 

^feys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
tf "50 55 60 

tphe Gly Tyr Gly Val Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

llpS 70 75 80 

t&a Gin Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 



